Machine Learning Techniques for Sentiment Analysis of Code-Mixed and Switched Indian Social Media Text Corpus - A Comprehensive Review
نویسندگان
چکیده
A comprehensive review of sentiment analysis for code-mixed and switched text corpus Indian social media using machine learning (ML) approaches, based on recent research studies has been presented in this paper. Code-mixing switching are linguistic behavior shown by the bilingual/multilingual population, primarily spoken but also written communication, especially media. involves combining lower units like words phrases a language into sentences other (the base language) code-switching to another language, length one sentence or more. In code-mixing switching, bilingual person takes more from introduces them while communicating that mode. People nowadays express their views opinions several issues multilingual countries, people English as well native languages. Several reasons can be attributed code-mixing. Lack knowledge particular subject, being empathetic, interjection clarification some name. Sentiment monolingual content carried out last two decades. However, during years, Natural Language Processing (NLP) focus shifted towards exploration data, thereby, making code mixed an evolving field research. Systems have developed ML techniques predict polarity fine tune existing models improve performance.
منابع مشابه
Sentiment Identification in Code-Mixed Social Media Text
Sentiment analysis is the Natural Language Processing (NLP) task dealing with the detection and classification of sentiments in texts. While some tasks deal with identifying presence of sentiment in text (Subjectivity analysis), other tasks aim at determining the polarity of the text categorizing them as positive, negative and neutral. Whenever there is presence of sentiment in text, it has a s...
متن کاملExperiments with POS Tagging Code-mixed Indian Social Media Text
This paper presents Centre for Development of Advanced Computing Mumbai’s (CDACM) submission to the NLP Tools Contest on Part-Of-Speech (POS) Tagging For Code-mixed Indian Social Media Text (POSCMISMT) 2015 (collocated with ICON 2015). We submitted results for Hindi (hi), Bengali (bn), and Telugu (te) languages mixed with English (en). In this paper, we have described our approaches to the POS ...
متن کاملJU_KS@SAIL_CodeMixed-2017: Sentiment Analysis for Indian Code Mixed Social Media Texts
This paper reports about our work in the NLP Tool Contest @ICON-2017, shared task on Sentiment Analysis for Indian Languages (SAIL) (code mixed). To implement our system, we have used a machine learning algorithm called Multinomial Naïve Bayes trained using n-gram and SentiWordnet features. We have also used a small SentiWordnet for English and a small SentiWordnet for Bengali. But we have not ...
متن کاملPOS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments
We discuss Part-of-Speech(POS) tagging of Hindi-English Code-Mixed(CM) text from social media content. We propose extensions to the existing approaches, we also present a new feature set which addresses the transliteration problem inherent in social media. We achieve an 84% accuracy with the new feature set. We show that the context and joint modeling of language detection and POS tag layers do...
متن کاملPreparing Bengali-English Code-Mixed Corpus for Sentiment Analysis of Indian Languages
Analysis of informative contents and sentiments of social users has been attempted quite intensively in the recent past. Most of the systems are usable only for monolingual data and fails or gives poor results when used on data with code-mixing property. To gather attention and encourage researchers to work on this crisis, we prepared gold standard Bengali-English code-mixed data with language ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Computer Science and Applications
سال: 2022
ISSN: ['2158-107X', '2156-5570']
DOI: https://doi.org/10.14569/ijacsa.2022.0130254